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Abstract 

Let H be a graph, and let Ch (G) be the number of (subgraph isomorphic) copies of H con- 
tained in a graph G. We investigate the fundamental problem of estimating Ch(G). Previous 
results cover only a few specific instances of this general problem, for example, the case when 
H has degree at most one (monomer-dimer problem). In this paper, we present the first gen- 
eral subcase of the subgraph isomorphism counting problem which is almost always efficiently 
approximable. The results rely on a new graph decomposition technique. Informally, the de- 
composition is a labeling of the vertices such that every edge is between vertices with different 
labels and for every vertex all neighbors with a higher label have identical labels. The labeling 
implicitly generates a sequence of bipartite graphs which permits us to break the problem of 
counting embeddings of large subgraphs into that of counting embeddings of small subgraphs. 
Using this method, we present a simple randomized algorithm for the counting problem. For 
all decomposable graphs H and all graphs G, the algorithm is an unbiased estimator. Further- 
more, for all graphs H having a decomposition where each of the bipartite graphs generated is 
small and almost all graphs G, the algorithm is a fully polynomial randomized approximation 
scheme. 

We show that the graph classes of H for which we obtain a fully polynomial randomized 
approximation scheme for almost all G includes graphs of degree at most two, bounded-degree 
forests, bounded-width grid graphs, subdivision of bounded-degree graphs, and major sub- 
classes of outerplanar graphs, series-parallel graphs and planar graphs, whereas unbounded- 
width grid graphs are excluded. Additionally, our general technique can easily be applied to 
proving many more similar results. 
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1 Introduction 

Given a template graph H and a ^ase graph G, we call an injection ip between vertices of H and vertices 
of G an embedding of if into G if </? maps every edge of -ff into an edge of G. In other words, ip is 
an isomorphism between H and a subgraph (not necessarily induced) of G. Deciding whether such an 
injection exists is known as the subgraph isomorphism problem. Subgraph isomorphism is an important 
and general form of pattern matching. It generalizes many interesting graph problems, including Clique, 
Hamiltonian Path, Maximum Matching, and Shortest Path. This problem arises in application areas ranging 
from text processing to physics and chemistry (8][3l[35][29l. The general subgraph isomorphism problem is 
NP-complete, but there are various special cases which are known to be fixed-parameter tractable in the size 
of H (2). 

In this work, we consider the related fundamental problem of counting the number of copies of a tem- 
plate graph in another graph. By a copy of H in G we mean any, not necessarily induced subgraph of 
G, isomorphic to H. In general the problem is #P-complete (introduced by Valiant ll37ll ). The class #P is 
defined as {/ : 3 a non-deterministic polynomial time Turing machine M such that on input x, the compu- 
tation tree of M has exactly f(x) accepting leaves}. Problems complete for this class are presumably very 
difficult, especially since Toda's result ll36l implies that a call to a #P-oracle suffices to solve any problem 
in the polynomial hierarchy in polynomial time. 

Fixed-parameter tractability of this counting problem has been well-studied with negative results for 
exact counting ifTTl and positive results for some special cases of approximate counting Q. In this paper, 
we are interested in the more general problem of counting copies of large subgraphs. Exact counting is 
possible for very few classes of non-trivial large subgraphs. Two key examples are spanning trees in a graph, 
and perfect matchings in a planar graph [27]. A few more problems such as counting perfect matchings in 
a bipartite graph (a.k.a. (0-1) permanent) |[25l . counting all matchings in a graph [24], counting labeled 
subgraphs of a given degree sequence in a bipartite graph [5 ], counting combinatorial quantities encoded by 
the Tutte polynomial in a dense graph HI, and counting Hamilton cycles in dense graphs 0, can be done 
approximately. But problems like counting perfect matchings in general graphs are still open. 

Since most of the other interesting counting problems are hopelessly hard to solve (in many cases 
even approximately) l22l . we investigate whether there exists a fully polynomial randomized approxima- 
tion scheme (henceforth, abbreviated as fpras) that works well for almost all graphs. The statement can be 
made precise as: Let G n be a graph chosen uniformly at random from the set of all n-vertex graphs. We say 
that a predicate V holds for almost all graphs if Pr[P(G n ) = true] — > 1 as n — > 00 (probability over the 
choice of a random graph). By fpras we mean a randomized algorithm that produces a result that is correct 
to within a relative error of lie with high probability (i.e., probability tending to 1). The algorithm must run 
in time poly(n, e^ 1 ), where n is the input size. We call a problem almost always efficiently approximable 
if there is a randomized polynomial time algorithm producing a result within a relative error of 1 ± e with 
high probability for almost all instances. 

Previous attempts at solving these kinds of problems have not been very fruitful. For example, even 
seemingly simple problems like counting cycles in a random graph have remained open for a long time 
(also stated as an open problem in the survey by Frieze and McDiarmid lfl4l ). In this paper we present new 
techniques that can not only handle simple graphs like cycles, but also major subclasses of more complicated 
graph classes like outerplanar, series-parallel, planar etc. 

The theory of random graphs was initiated by Erdos and Renyi iflOl . The most commonly used models 
of random graphs are Q(n,p) and Q{n,m). Both models specify a distribution on n-vertex graphs. In 
G(n,p) each of the (2) edges is added to the graph independently with probability p and Q(n, m) assigns 
equal probability to all graphs with exactly m edges. Unless explicitly stated otherwise, the default model 
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Function Count: Assume Pi > for all i and J2i=iPi^^- 

If some element X{ is picked (with probability pi) then output Z = — 
Else output Z = 

Figure 1: Estimator for the cardinality of S. 

addressed in this paper is Q(n,p). 

There has been a lot of interest in using random graph models for analyzing typical cases (beating 
the pessimism of worst-case analysis). Here, we mention some of these results relevant to our counting 
problem (see the survey of Frieze and McDiarmid lfl4ll for more). One of the most well-studied problem is 
that of counting perfect matchings in graphs. For this problem, Jerrum and Sinclair [23 ] have presented a 
simulation of a Markov chain that almost always is an fpras (extended to all bipartite graphs in [25 ]). Similar 
results using other approaches were obtained later in lTT2l[3Tl l6l [T7l . Another well-studied problem is that of 
counting Hamiltonian cycles in random digraphs. For this problem, Frieze and Suen [15 ] have obtained an 
fpras, and later Rasmussen [31 ] has presented a simpler fpras. Afterwards, Frieze et al. [ 13] have obtained 
similar results in random regular graphs. Randomized approximation schemes are also available for counting 
the number of cliques in a random graph [32]. However, there are no general results for counting copies of 
an arbitrary given graph. 

1.1 Our Results and Techniques 

In this paper, we remedy this situation by presenting the first general subcase of the subgraph isomorphism 
counting problem which is almost always efficiently approximable. For achieving this result we introduce a 
new graph decomposition that we call an ordered bipartite decomposition. Informally, an ordered bipartite 
decomposition is a labeling of vertices such that every edge is between vertices with different labels and 
for every vertex all neighbors with a higher label have identical labels. The labeling implicitly generates 
a sequence of bipartite graphs and the crucial part is to ensure that each of the bipartite graphs is of small 
size. The size of the largest bipartite graph defines the width of the decomposition. The decomposition 
allows us to obtain general results for the counting problem which could not be achieved using the previous 
methods. It also leads to a relatively simple and elegant analysis. We will show that many graph classes 
have such decomposition, while at the same time many simple small graphs (like a triangle) may not possess 
a decomposition. 

The actual algorithm itself is based on the following simple sampling idea (known as importance sam- 
pling as statistics): let S = {x\, . . . , x z } be a large set whose cardinality we want to estimate. Assume that 
we have a randomized algorithm that picks each element xi with non-zero known probability pi. Then, the 
function Count (Fig. [I]) produces an estimate for the cardinality of S. The following proposition shows that 
the estimate is unbiased, i.e., E[Z] = \S\. 

Proposition 1. The function Count is an unbiased estimator for the cardinality of S. 

Proof. It suffices to show that each element Xi has an expected contribution of 1 towards \S\. This holds 
because on picking Xj (an event that happens with probability p{), we set Z to the inverse probability of this 
event happening. Therefore, K[Z] = J2iPi ' ^ = \&\- ^ 

Similar schemes of counting have previously been used by Hammersley |[T9l and Knuth EST! in other 
settings. Recently, this scheme has been used by Rasmussen for approximating the permanent of a (0-1) 
matrix (31], and later for approximately counting cliques in a graph [32]. A variant of this scheme has 
also been used by the authors to provide a near linear-time algorithm for counting perfect matchings in 
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random graphs lfT6l[T7l . This is however the first generalization of this simple idea to the general problem 
of counting graph embeddings. Another nice feature of such schemes is that they also seem to work well in 
practice ll34l . 

Our randomized algorithm will try to embed H into G. If the algorithm succeeds in finding an em- 
bedding of H in G, it outputs the inverse probability of finding this embedding. The interesting question 
here is not only to ensure that each embedding of H in G has a positive probability of being found but also 
to pick each embedding with approximately equal probability to obtain a low variance. For this purpose, 
the algorithm considers an increasing subsequence of subgraphs H\ C H2 C • • • C Hi = H of H. The 
algorithm starts by randomly picking an embedding of H\ in G, then randomly an embedding of H2 in G 
containing the embedding of H\ and so on. It is for defining the increasing sequence of subgraphs that our 
decomposition is useful. 

The algorithm is always an unbiased estimator for Ch{G). The decomposition provides a natural suf- 
ficient condition for the class of algorithms based on the principle of the function Count to be an unbiased 
estimator. Additionally, if the base graph is a random graph from Q (n, p) with constant p and if the template 
graph has an ordered bipartite decomposition of bounded width, we show that the algorithm is an fpras. The 
interesting case of the result is when p = 1/2. Since the Q(n, 1/2) model assigns a uniform distribution 
over all graphs of n given vertices, an fpras (when the base graph is from Q{n, 1/2)) can be interpreted as 
an fpras for almost all base graphs. This result is quite powerful because now to prove that the number of 
copies of a template graph can be well-approximated for most graphs G, one just needs to show that the 
template graph has an ordered bipartite decomposition of bounded width. 

The later half of the paper is devoted to showing that a lot of interesting graph classes naturally have 
an ordered bipartite decomposition of bounded width. Let Ck denote a cycle of length k. In this extended 
abstract, we show that graphs of degree at most two, bounded-degree forests, bounded-width grid (lattice) 
graphs, subdivision of bounded-degree graphs, bounded-degree outerplanar graphs of girth at least four, 
and bounded-degree [C3,Cs]-free series-parallel graphs, planar graphs of girth at least 16 have an ordered 
bipartite decomposition of bounded width. Using this we obtain the following result (proved in Theorems [3] 
and©. 

Theorem 1 (Main Result). Let H be a simple graph where each connected component is one of the fol- 
lowing: graph of degree at most two, bounded-degree tree, bounded-width grid, subdivision of a bounded- 
degree graph, bounded-degree C^-free outerplanar graph, bounded-degree [C^,,C^\-free series-parallel graph, 
or planar graphs of girth at least 16. Then, for almost all graphs G, there exists an fpras for estimating the 
number of copies of H in G. 

Even when restricted to graphs of degree at most two, this theorem recovers most of the older results. 
It also provides simpler, unified proofs for (some of) the results in fT2l I3T1 l6l [T31l . For example, to count 
matchings of cardinality k one could use a template consisting of k disjoint edges. Similarly, to count all 
cycles of length k the template is a cycle of that length. By varying k and boosting the success probability, 
the algorithm can easily be extended to count all matchings or all cycles. This provides the first fpras for 
counting all cycles in a random graph (solving an open problem of Frieze and McDiarmid [HI). We omit 
further discussion of this problem. 

For template graphs coming from the other classes, our result supplies the first efficient randomized 
approximation scheme for counting copies of them in almost all base graphs. For example, it was not 
known earlier how to even obtain an fpras for counting the number of copies of a given bounded-degree tree 
in a random graph. For the simpler graph classes the decomposition follows quite straightforwardly, but for 
graph classes such as subdivision, outerplanar, series-parallel, and planar, constructing the decomposition 
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requires several new combinatorial/algorithmic ideas. Even though our techniques can be extended to other 
interesting graph classes, we conclude by showing that our techniques can't be used to count the copies of 
an unbounded-width grid graph in a random graph. 

2 Definitions and Notation 

Let Q be some function from the set of input strings S* to natural numbers. A fully polynomial randomized 
approximation scheme for Q is a randomized algorithm that takes input iGE' and an accuracy parameter 
e £ (0,1) and outputs a number Z (a random variable depending on the coin tosses of the algorithm) such 
that, 

Pr[(l - e)Q(x) <Z<(1 + e)Q(x)} > 3/4, 

and runs in time polynomial in \x\, e _1 . The success probability can be boosted to 1 — 5 by running the 
algorithm 0(log <5 _1 ) times and taking the median [26]. 

Automorphisms are edge respecting permutations on the set of vertices, and the set of automorphisms 
form a group under composition. For a graph H, we use aut(H) to denote the size of its automorphism 
group. For a bounded-degree graph H, aut(H) can be evaluated in polynomial time [30]. Most of the other 
graph-theoretic concepts which we use, such as planarity are covered in standard text books (see, e.g., 171). 

Throughout this paper, we use G to denote a base random graph on n vertices. The graph H is the 
template whose copies we want to count in G. We can assume without loss of generality that the graph H 
also contains n vertices, otherwise we just add isolated vertices to H. The number of isomorphic images 
remains unaffected. Let A = A(iT) denote the maximum degree of H. 

For a graph F, we use Vf to denote its vertex set and Ep to denote its edge set. Furthermore, we 
use vf = \Vf\ and ej? = \Ep\ for the number of vertices and edges. For a subset S of vertices of F, 
N F (S) = {v e V F - S : 3u £ S such that (u,v) e E F } denotes the neighborhood of S in F. F[S] 
denotes the subgraph of F induced by S. 

We use Ch(G) to denote the number of copies of H in G. Let Ljj{G) = Ch{G) ■ aut(H) denote 
the number of embeddings (or labeled copies) of H in G. For a random graph G, we will be interested in 
quantities E[C H (G) 2 } and E[C H (G)] 2 . 

Our algorithm is randomized. The output of the algorithm is denoted by Z, which is an unbiased 
estimator of Ch(G), i.e., Ch{G) = E^[Z] (expectation over the coin tosses of the algorithm). As the 
output of our algorithm depends on both the input graph, and the coin tosses of the algorithm, we use 
expressions such as EgfE^fZ]]. Here, the inner expectation is over the coin-tosses of the algorithm, and the 
outer expectation is over the graphs of Q(n,p). Note that E^fZ] is a random variable defined on the set of 
graphs. 

3 Approximation Scheme for Counting Copies 

We define a new graph decomposition technique which is used for embedding the template graph into the 
base graph. As stated earlier our algorithm for embedding works in stages and our notion of decomposition 
captures this idea. 

Ordered Bipartite Decomposition. An ordered bipartite decomposition of a graph H = (Vh, Eh) is a 

sequence V\ , - - - , Vt of subsets of Vjj such that: 

® Vi,...,Vi form a partition of Vh- 

® Each of the V% (for i € [£] = {1, . . . , £}) is an independent set in H. 
® Wi 3j such that v eV implies N H (v) C (|J fe< . V k ) U Vj. 
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Property © just states that if a neighbor of a vertex v E Vi is in some Vj (j > i), then all other neighbors 
of v which are not in V\ U • • • U Vi-i, are in Vj. Property © will be used in the analysis for random graphs 
to guarantee that in every stage, the base graph used for embedding is still random with the original edge 
probability. 

Let V' 1 = U ; . ,\[ r Define 

u i = N H (v i )nv i - 1 . 

Ui is the set of neighbors of Vi in Vi U • • • U V%-\. Define Hi to be the subgraph 
of H induced by Ui U Vi. Let denote the edge set of graph Hi. The width of 
an ordered bipartite decomposition is the size (number of edges) of the largest Hi. 

The C/j's will play an important role in our analysis. Note that given a Uj, its 
corresponding Vj has the property that Vj 2 Nn(Uj) — V 3 ' -1 . Hereafter, when the 
context is clear, we just use decomposition to denote an ordered bipartite decompo- 
sition. In general, the decomposition of a graph needn't be unique. The following 
lemma describes some important consequences of the decomposition. The proofs 
of this section are collected in Appendix lAl 

Lemma 1. Let Vi, . . . ,Vg be a decomposition of a graph H = (Vjj, Eh). Then, the following assertions 
are true, (i) Each of the Ui is an independent set in H (Hi is a bipartite graph), (ii) The edge set Eh is 
partitioned into Eh± , • • • , En t . 

Every graph has a trivial decomposition satisfying properties ® and ©, but the situation changes if we 
add property © (C3 is the simplest graph which has no decomposition). Every bipartite graph though has a 
simple decomposition, but not necessarily of bounded width. Note that the bipartiteness of H is a sufficient 
condition for it to have an ordered bipartite decomposition, but not a necessary one. 

We will primarily be interested in cases where the decomposition is of bounded width. This can only 
happen if A is a constant. In general, if A grows as a function of n, no decomposition could possibly have 
a bounded width (A/2 is always a trivial lower-bound for the width). For us the parameter I plays no role. 



Algorithm Embeddings(G,H) 

Initialize X <- 1, Mark(0) <- 0, ip(0) *- 
let Vi, . . . , Ve denote an ordered bipartite decomposition of H 
for i <— 1 to I do 

let Gf <— G[Vg ~ Mark(i - 1) U <p(Ui)] (Gf is the subgraph of G used for embedding Hi) 
compute Xi, the number of embeddings of Hi in Gf with Ui being mapped by ip 
pick an embedding uniformly at random (if one exists) and use it to update tp 
if no embedding exists, then set Z to and terminate 

X «- X ■ Xi 

Mark(i) <- Mark(i - 1) U <p(Vi) 
Z <- X/aut(H) 
output Z 

The input to the algorithm Embeddings is the template graph H together with its decomposition and the 
base graph G. The algorithm tries to construct a bijection ip between the vertices of H and G. Vi represents 
the set of vertices of H which get embedded into G during the i th -stage, and the already constructed mapping 
of Ui is used to achieve this. For a subset of vertices S C Vh, <f(S) denotes the image of S under ip. If 




6 



X > 0, then the function <p represents an embedding of H in G (consequence of properties ® and ©), 
and the output X represents the inverse probability of this event happening. Since every embedding has a 
positive probability of being found, X is an unbiased estimator for the number of embeddings of H in G 
(Proposition Q]), and Z is an unbiased estimator for the number of copies of H in G. 

The actual procedure for computing the Xj's is not very relevant for our results, but note that the X{S 
can be computed in polynomial time if H has a decomposition of bounded width. In this case the algorithm 
Embeddings runs in polynomial time. 

3.1 FPRAS for Counting in Random Graphs 

Since the algorithm Embeddings is an unbiased estimator, use of Chebyshev's inequality implies that re- 
peating the algorithm 0(e~ 2 E^4[Z 2 ]/E^4[Z] 2 ) times and taking the mean of the outputs results in a random- 
ized approximation scheme for estimating Ch(G). From here on, we abbreviate Ch{G) as C. The ratio 
E^[Z 2 ]/E^[Z] 2 is commonly referred to as the critical ratio. 

We now concentrate on showing that for random graphs the algorithm is an fpras. A few of the technical 
details of our proof are somewhat similar to previous applications of this sampling idea, such as that for 
counting perfect matchings OTlfTTl . The simpler techniques in these previous results, however, are limited 
to handling one edge per stage (therefore, working only when H is a matching). Our algorithm embeds a 
small sized subgraph at every stage. The key for obtaining an fpras is to guarantee that the factor contributed 
to the critical ratio at every stage is very small (which is now involved because it is no longer a simple ratio 
of binomial moments as in OT1 [T71 ). Adding this to the fact that we can do a stage-by-stage analysis of 
the critical ratio (thanks to the decomposition property which ensures the graph stays essentially random), 
provides the ingredients for the fpras. 

The analysis will be done for a worst-case graph H under the assumption that the sizes of the bipartite 
graphs H^s are bounded by a universal constant w, and a random graph G. Here, instead of investigating 
the critical ratio, we investigate the much simpler ratio Eg[E^[Z 2 ]]/Eg[E^[Z]] 2 , which we call the critical 
ratio of averages. We use the second moment method to show that these two ratios are closely related. For 
this purpose, we take a detour through the G(n, m) model. The ratio E[(7 2 ]/E[C] 2 plays an important role 
here and for bounding it we use a recent result of Riordan [33]. The result (stated below) studies the related 
question of when a random graph G is likely to have a spanning subgraph isomorphic to H. 

In the following, N is used to denote (™). We say an event holds with high probability (w.h.p.), if it 
holds with probability tending to 1 as n — > oo. 

Theorem 2. (Riordan H33\l ) Let H be a graph on n vertices. Let en = otN = a(n)N, and let p = 
p(n) £ (0, 1) with pN an integer. Suppose that the following conditions hold: aN > n, and pN, (1 — 
p)y/n, np^ /A 4 — > oo, where 

7 = j(H) = max {maxjei? : F C H, vp = s}/(s — 2)}. 

3<s<n 

Then, w.h.p. a random graph G £ Q(n,pN) has a spanning subgraph isomorphic to H. 

The quantity 7 is closely related to twice the maximum average degree of a subgraph of H. The idea 
behind the proof is to use Markov's inequality to bound Pr[C = 0] in terms of E[C] and Var[C]. The main 
thrust lies in proving that E[C 2 ]/E[C] 2 = 1 + o(l). Now by just following Riordan's proof, we obtain the 
following result. Since the proof reveals nothing new, we omit it in this abstract. 

Proposition 2. Let H be a graph on n vertices. Let en = 01N = a(n)N, and let p = pin) G (0, 1) 
with pN an integer. Let v = max{2,7}. Suppose that the following conditions hold: pN,np u / 'A 4 — > 00 
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and a 3 Np~ 2 — ► 0. Then, w.h.p. a random graph G € Q{n,pN) satisfies E[C 2 ]/E[C] 2 = 1 + o(l). /« 
particular, if H is a bounded-degree graph on n vertices. Then, w.h.p. a random graph G G Q(n,Q(n 2 )) 
satisfies E[C 2 ]/E[C] 2 = 1 + o(l). 

Note that some of the conditions in Proposition [2] are rephrased from Theorem [2] These are the con- 
ditions in the proof of Theorem |2] that are needed for bounding E[(7 2 ]/E[C] 2 . We will be interested in 
bounded-degree graphs H. For a bounded-degree graph H, both A and 7 are constants. Additionally, we 
will be interested in dense random graphs (where the conditions of Proposition [2] are satisfied). Interpreting 
Proposition |2]in the Q (n, p) model by using known results for asymptotic equivalence between Q (n, m) and 
Q(n,p) models (e.g., see Proposition 1.12 of [21]) yields (proof omitted) 

Lemma 2. Let H be a bounded-degree graph on n vertices. Let lo = u(n) be any function tending to 00 as 
n — > 00, and let p be a constant. Then, w.h.p. a random graph G G G(n,p) satisfies C > E[C]/u;. 

Remark: Since C is fairly tightly concentrated around its mean, a rudimentary approximation for C is just 
E[C] = aut(H) ( as Vh = n ) - However, this naive approach doesn't produce for any e > 0, an (1 ± e)- 
approximation for C (see, e.g., |[3Tl[T2l[T5l[32l ). 

Using the above result we investigate the performance of algorithm Embeddings when G is a random 
graph. In this extended abstract, we don't try to optimize the order of the polynomial arising in the running 
time analysis. Even though for simple template instances such as matchings or cycles, one could easily 
determine the exact order. The proof idea is to break the critical ratio analysis of the large subgraph into a 
more manageable critical ratio analysis of small subgraphs. 

Theorem 3 (Main Theorem). Let H be a n-vertex graph with a decomposition of width w (a constant). 
Let Z be the output of algorithm Embeddings, and let p be a constant. Then, w.h.p. for a random graph 
G G Q(n,p) the critical ratio E^[Z 2 ]/E^[Z] 2 is polynomially bounded in n. 

Summarizing, we have the following result: if H has a decomposition of bounded width w, then for 
almost all graphs G, running the algorithm Embeddings poly(ra)e -2 times and taking the mean, results in 
an (1 ± e) -approximation for C. Here, poly(ra) is a polynomial in n depending on w and p. Since each run 
of the algorithm also takes polynomial time (as H has bounded width), this is, an fpras. 

4 Graphs with Ordered Bipartite Decomposition 

We divide this section into subsections based on the increasing complexity of the graph classes. Some of 
the later graph classes include the ones that will be covered earlier. We will prove the following result in the 
remainder of this section. Decomposition of subdivision graphs is presented in Appendix ICl 

Theorem 4. Let H be a graph where each connected component is one of the following: graph of degree 
at most two, tree, bounded-width grid, subdivision graph, C^-free outerplanar graph, [C^,C^\-free series- 
parallel graph, or planar graph of girth at least 16. Then, there exists an ordered bipartite decomposition 
of H. Furthermore, if H has bounded degree, then the decomposition has bounded width. 

From now onwards, we concentrate on connected components of the graph H. If H is disconnected a 
decomposition is obtained by combining the decomposition of all the connected components (in any order). 
We will abuse notation and let H stand for both the graph and a connected component in it. A is the 
maximum degree in H. For constructing the decomposition, the following definitions are useful, 

IP = \J U j} V i = \J Vj, and D i = V i - U\ 
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A = i — ^A^np ^A 

A - = | > A — T - ^"A 

it ' — sJ — ssJ — sJ — ==J 



Figure 2: Decomposition of a cycle, tree, and grid. The graphs are actually undirected. The arrows just connect 
the vertices of Ui to their neighbors in Vi. Let V\ = {si}. All out-neighbors of a vertex are in the same Vi, and 
all in-neighbors of a vertex are in the same Ui. 



4.1 Some Simple Graph Classes 

We start off by considering simple graph classes such as graphs of degree at most two (paths and cycles), 
trees, and grid graphs. Fig. [2] illustrates some examples. 

• Paths: Let H represent a path (si, . . . , Sk+i) of length k 
Vi = { Si } for 1 < i < k + 1. 



k{n). Then the decomposition is, 



Cycles: First consider the cycles of length four or greater. Let s\, . . . , s& be the vertices of a cycle H 
of length k = k(n) enumerated in cyclic order. In the decomposition, V\ = {si}, V2 = {S2, Sfc}, an d 
Vi = {s^ for 3 < i < k — 1. Cycles of length three (triangles) don't have a decomposition, but can 
easily be handled separately (see Appendix iBl). Actually, if H = H\ U H2, where graphs Hi and H2 
are disjoint, Hi has a decomposition of bounded width, and H2 consists of a vertex disjoint union of 
triangles, then again, there exists an fpras for estimating C (the details of this proof is briefly sketched 
in Appendix |B]). This also completes the claim for graphs of degree at most two in Theorem Q] 

Trees: For a tree H, V\ = {si}, where si is any vertex in H. For i > 2, let Ui be any vertex from 
D , then Vi is the set of neighbors of this vertex which are not in V l ~ x . Intuitively, Vi is the set of 
children of the vertex in Ui, if one thinks of H as a tree rooted at si. The width of the decomposition 
is at most A. 

Grids: Let wq be the width of the grid graph H. Set V\ = {si}, where si is any corner vertex in 
H. Later on, Vi is the set of all vertices which are at a lattice (Manhattan) distance i from s%. Since 
for each i, there are at most wq vertices at distance i from from s\, the sizes of the V^'s are bounded 
if wq is bounded. Consequently, the width of the decomposition is bounded if wq is bounded. This 
construction also extends to higher dimensional grid graphs. 
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Outerplanar Graphs 

A graph is outerplanar if it has a planar embedding such that all vertices are on the 
same face. Let H be a C3-free outerplanar graph. The idea behind the decompo- 
sition is that vertices in Ui partitions the outer face into smaller intervals, each of 
which can then be handled separately. 

Before we formally describe the decomposition, we need some terminology. 
Let si,...,s& be the vertices around the outer face with k = k(n) (ordering 
defined by the outerplanar embedding). For symmetry, we add two dummy ver- 
tices sq, Sfc + i without neighbors and define Ui = {sq, Sfc+i}> an d V\ = {si} (the 
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dummy vertices play no role and can be removed before running the algorithm Embeddings). For i > 1, 
two vertices Sj , Sj r with jo < j\, define a stage i interval if Sj Q , Sj 1 £ IP, but for jo < I < ji, s/ ^ U l . If 
the interval is defined it is the sequence of vertices between Sj , Sj 1 (including the endpoints). Let dj be a 
median vertex of / n V % (median based on the ordering), where / is a stage i interval. Define Ui + i as the 
smallest subset of V i containing {aj and also N H (N H (U i+ i) - V { ) n V\ Define V i+X as N H {U i+1 ) - V\ 
We now argue that this is indeed a decomposition. Consider a stage i interval /, with Sj , Sj x as the defining 
end points, and aj as the median of / n V i . The proofs of this subsection are collected in Appendix 10 

Lemma 3. For every i > 1, there is a stage i interval I with t/j+i C /. 

Lemma 4. Let I be a stage i interval with Ui + \ C I. Then, < |/n V i \ < 2 A. 

The properties ® and ® are guaranteed by the construction. Lemma 0] implies that the width of the de- 
composition is most 2A 2 . Property ® holds because there are no triangles in H. See Fig.[3](in Appendix lGl> 
for an example. 

4.3 Decomposition of Series-Parallel Graphs 

A series-parallel graph (also called a two-terminal series-parallel graph) is a graph with two distinguished 
vertices s and t that is obtained as follows. A single edge (s,i) is a series-parallel graph (base case). Let 
H a and be two series-parallel graphs with terminals s a ,t a and Sb, h respectively. The graph formed by 
identifying t a with si, is a series-parallel graph with terminals s a , ti, (series operation is denoted by ©). The 
graph formed by identifying s a with and t a with 4 is a series-parallel graph with terminals s a = and 
ta = tb (parallel operation is denoted by 1 1). 

In the following, the process of adding a vertex to some 14 is referred by the term selecting. We 
say a vertex is finished once it is added to some Ui, i.e., all its neighbors are selected. The algorithm is 
recursive. The construction is technical, but the basic idea is to first finish the terminals, so that the parallel 
components separate (for the decomposition purposes). Then the algorithm finishes some vertex joining two 
serial components. In both these steps the algorithm might be forced to finish some other vertices too. We 
will discuss the rest of the construction in Appendix [Dj We now summarize the result. 

Lemma 5. Let H be a [C3,C^,]-free series-parallel graph. Then H has a decomposition of width 0(A 2 ). 

4.4 Planar Graphs 

Define a thread as an induced path in H whose vertices are all of degree 2 in H. A fc-thread is a thread with 
k vertices. Let H be a planar graph of girth at least 16. We first prove a structural result on planar graphs. 
The proofs of this subsection are collected in Appendix 10 

Lemma 6. Let H be a planar graph of minimum degree 2 and girth at least 16, then H always contains a 
3-thread. 

In order to define a decomposition, we define a 3-thread partition X\ , . . . , X c of a planar graph H as a 
partition of Vh such that each Xi satisfies 

f {aj}, where is a degree or 1 vertex in the graph induced by Vh — Uj<j Xj on or 
\ {cii, bi, Ci}, where a^, bi, Ci form a 3-thread in the graph induced by Vh — Uj<i Xj on H. 

By Lemma[6]every planar graph with girth at least 16 has a 3-thread partition. As earlier, we say, a vertex 
is selected if we add it to some 14. Using the 3-thread partition (which can be constructed using Lemma[6]), 
a decomposition of a planar graph of girth at least 16 can be constructed by repeating this following simple 
procedure, 
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i. Find the largest index I such that Xi contains a vertex z\ which has not yet been selected, but is 
adjacent to an already selected vertex. 

ii. Define U t = N H (zi) n D*~ l and V = N H (Ui) - V*' 1 . 

iii. Increment i. 

Lemma 7. Let H be a planar graph of girth at least 16. Then the above procedure finds a decomposition 
of H of width at most 2A. 

5 Negative Result for Ordered Bipartite Decomposition 

As mentioned earlier only graphs of bounded degree have a chance of having a decomposition of bounded 
width. So a natural question to ask is whether all bounded-degree graphs with a decomposition have one of 
bounded width. In this section, we answer this question negatively by showing that every unbounded-width 
grid graph fails to satisfy this condition. For simplicity, we will only consider y/n x y/n grid graphs, but 
our proof techniques extend to other cases as well. The proofs of this section are collected in Appendix [Gj 

Let H = (Vh,Eh) be a y/n x y/n grid graph with Vh = {(i,j) '■ < i,j < y/n — 1} and 
Eh = {((*>j)) (}'>?)) '■ i = i' and \j — j'\ = lor \i — i'\ = 1 and j = j'}. We now show that any 
decomposition of H has a width of at least y/n. Let V\ , . . . , Vg be any decomposition of H. Consider any 
2x2 square of H defined by vertices a, b, c, d. The two neighbors a, b of the vertex c with the smallest label 
I always have the same label I' > I. The fourth vertex d has any label I" with I" > I and I" ^ I'. We define 
a new graph H' = (Vh, Eh>) on the same set of vertices by putting the edge (a, b) into Eh>- Note that all 
vertices in a connected component have the same label thus are chosen together (appear in the same Vfe). 

Let TLd be a class of graphs on vertex set Vh with exactly one diagonal in every 2x2 square (and no 
other edges). That is any graph Ho = (Vh, Ed) from TLd has for every (i,j) with < i, j < y/n — 2 
exactly one of the edges ((i,j), (i + 1, j + 1)), ((i,j + 1), (i + 1, j)) in Ed and no other edges are in Ed- 
Note that H' G TLd- The following theorem shows that any graph Hd £ TLd has the property that there is 
a connected component touching top and bottom or left and right. 

Theorem 5. There exists a connected component of Hd that contains at least one vertex from every row or 
at least one vertex from every column. 

Corollary 1 (Negative Result). There exists no decomposition of a y/nx y/n-grid graph H of width y/n— 1. 

6 Conclusions and Open Problems 

The natural question arising from this work is what other classes of graphs have an ordered bipartite de- 
composition and more importantly which of them have one of bounded width. Other than the graph classes 
mentioned above, the bounded-degree [C3, Cs]-free Halin graphs |[T8l where degree two vertices are allowed 
and hexagonal grid graphs are some other interesting graph classes which have bounded width decomposi- 
tions. Another interesting problem would be to investigate the general complexity of the ordered bipartite 
decomposition and possibly characterize its relation to other existing graph decomposition schemas. The 
notion of bounded width decomposition is a natural sufficient condition for the class of algorithms based 
on the principle of the function Count to give almost always an fpras. But the necessary condition for the 
general approach to work is still unclear. Finally, a challenging open problem is to obtain any such general 
result for counting in arbitrary dense graphs. 
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A Proofs from Section |3] 

Proof of Lemma[TJ For part (i), assume otherwise. Let (u, v) be an edge in H with both u, v G f/j. Let u 
appear in some Vj (J < i) and v appear in some Vk (k < i). Property © implies that j ^ k. Assume without 
loss of generality that j < k. Property © implies there exists no vertex w G Nh(u) such that w G Vi. 
Therefore, u ^ U-i. Contradiction. Additionally, since each of the Ui and Vi is an independent set, each of 
the graph Hi is bipartite. 

For part (ii), first note that due to properties ® and ©, the C/j's are pairwise disjoint (but they do not nec- 
essarily form a partition). Therefore, the En i 's are also pairwise disjoint. Now since for every edge (n, v) 
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there exist a j, k such that u G J7j and v G and without loss of generality j < k. Then, u G Uk and 

(u, v) G Thus, , . . . , £^ form a partition of □ 

Proof of Theorem |3l We first relate the critical ratio to the critical ratio of averages. As the estimator is 
unbiased = C. Therefore, from Lemma[2j 



Squaring both sides, 



2 . Wf 

^L Z J = — TTUvI - 



aut(H) 2 ~ uj 2 aut(H) 2 



Note that E A [X]/aut(H) refers to the expected output for fixed graph G, and the inequalities hold for almost 
all such graphs G, while Eg[E A [X]]/ aut(H) is the expected output for a random graph G G Q(n,p). 

The numerator of critical ratio of averages, Eg[E_4[Z 2 ]] = Eg[E A [X 2 ]] / aut(H) 2 . Using Markov's 
inequality 

Pr[E^[Z 2 ] > u;Eg[E A [Z 2 }]} <- n ^0 

UJ 

Using the above inequalities yields 

E A [Z 2 ] _E A [X 2 ] 3 E g [E A [Z 2 }] _ 3 E g [E^[X 2 ]] 

^ UJ : : — 7777 — UJ 



E A [Z] 2 E A [X} 2 ~ Eg[E A [Z}] 2 Eg[E A [X}} 2 - 

Now, we just concentrate on bounding the critical ratio of averages. Let V\, . . . , Vt denote a decompo- 
sition of H of width w. In the bipartite graph Hi = (Ui, Vj, Ejji), let ej = l-E^J, «j = \V%\, and it, = |J7j|. 
Let rii = n — ^j<» v r We wm re ly on tne ^ act ^at an the Hi's are of bounded width. 

Let = rii+Ui. Let Gi be a random graph from Qin'^p) with Uj distinguished vertices. Let L H .\u.(Gi) 
denote the number of embeddings of Hi in Gi where the mapping of the vertices in Ui to the distinguished 
vertices in Gi is fixed (given). The results don't depend on the mapping used for Ui. We abbreviate 
Affile by Li. 

First we investigate the numerator of the critical ratio of averages. Here we use the fact that 

Eg[E A [X 2 ]] = E g [E A [X 2 ]] E g [E A [X 2 ]]. 

The previous equality arises, because at the i th -stage, the graph used for embedding Hi is from G(n'i,p) 
irrespective of the choices made over the first (i — l)-stages. This is guaranteed by property ® of the 
decomposition and in turn it allows us to perform a stage-by-stage analysis of the critical ratio. 

Furthermore, Eg[E^[X 2 ]] = E[L 2 ] (as the graph is random, it doesn't matter which vertices U gets 
mapped to). Next we investigate the denominator of the critical ratio of averages. Here we use the fact that 

E g [E^[X]] 2 = (n\p^) 2 = E[L X ] 2 E[L,] 2 . 

Therefore, the ratio 

Eg[E A [X 2 ]] Eg[E A [X 2 }] Eg[E A [X 2 }] 



Eg[E A [X}} 2 EgpEUpfi]] 2 Eg[E^[X,]] 2 

nLl) E[L 2 ] AE[L 2 ] 



E[L,] 2 11 E[L 



n 
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To bound this expression we investigate the parameter Var[Li]. 

Now consider a complete bipartite graph K Ui>ni with one side being the U{ distinguished vertices of Gi 
and the other side being the remaining (non-distinguished) vertices of G{. Let T^.^ijiK^^ be the set of 
embeddings of Hi in K Ui>n . where the mapping of the vertices in Ui to the vertices in K Ui>ni is fixed as in Gi. 
For each embedding / from J^jj.ijj^K^^) define the indicator random variable IffHA = 1[/C^i) Q Gi]- 
For each F C Hi, let ep be the number of edges in F, and let tf be the number of vertices in F which belong 
to Vi. Now there are Q{n 2Vl ~ rp ) pairs (f,g) of embeddings of H { in F Hl \uS K ^,^) witn f{ H i) n 9 {Hi) 
isomorphic (~) to F. In the following, we use A X B for ^4 = @(B). 



Var[Li] = ^2Cov[I f{Hi) ,I giH . } ]= E[I /(JJi) I fl(Hi) ] - E[/ /(//i) ]E[/ 9(Hi) 

/.9 f,9 

= E E E [ J /("i) W " E[// (ffl) ] E [W 



FCHi,e F >0 f,g 

f(Hi)ng(Hi)~F 



^2 P 2e *- eF -p 2e ^ Yl ^~ rF Cp 2ei ~ eF 



p 2 ^ 



FCHi,e F >0 f,g FCHi,e F >0 

f(Hi)ng(Hi)~F 

E[L t ] 2 e E[L,] 2 e 

= > — k; (l-p eF )x > r F (1 - P ) x max ' (1 - p ep ). 

FC J f/ i ,e F >0 1 F FCHi,e F >0 1 F 1 1 

The second equality used the fact that random variables IfiHA an d IgiHA w& independent if Efr Hi \ n 
Eg/fjA = 0. The implicit constants in the above equivalences depend on the width of Hi (a constant), but 
are independent of n,;. The quantity 

(1 -p e F\ 

max — =- = Ofl/n,) (rj? = 1, provides the maximum). 

FCH it e F >0 n\ F p e F 

Therefore, Far^/E^] 2 = 0(1/ rn), implying E[L 2 ]/E[L;] 2 = l + 0(l/ni). If e; = 0, then Far[Li] = 
0, and E[L 2 ] = E[Lj] 2 . Putting everything together, we obtain 

^A[X 2 ]] yr E[£fl jr( c 
Eg\E A [X]]* l\nL^ -11V + 



for constant c depending only on w and p. Since n = n\ > n 2 > • • • > ri£, Y\ i=1 (l + — ) can be 
polynomially bounded (to 0(n c )) by a telescoping argument. □ 

B Extension to Disjoint Triangle Case 

For simplicity, we will discuss only the case where H is a union of n/3 vertex disjoint triangles. Even though 
H doesn't have a decomposition, there is a simple fpras for counting copies of H in random graphs. Exten- 
sion to other possible cases of template graphs with triangles in them as permissible under the conditions 
of Theorem [T] is straightforward. Let si, . . . , s n be the vertices in H, with every triplet s^x, s^ + 2, S3«+3 
forming a triangle in H. 

Let Z = X/aut{H) be the output of the algorithm Embeddings for inputs H and G € G(n,p = 
constant), but where each V{ = {si} and t = n (even though V\, . . . , Vi is not an ordered bipartite decom- 
position). As in Theorem[3j we will again investigate the ratio Eg [E_4 [X 2 ] ] /Eg [E_4 [X] ] 2 which equals the 
critical ratio of averages. 
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The numerator, 

E g [E A [X 2 ]] = Eg[E A [X 2 X 2 X 2 }} = Eg[E A [X 2 X 2 X 2 }} E^LY^X^^]]. 

The last equality follows because after embedding each triangle the subgraph of G into which nothing has 
been embedded yet is random with the original edge probability p. Consider a representative term from this 
product, 

Eg[E A [X 2 i+1 X 2 i+2 X 2 i+3 }\ = Eg[E A [X 2 i+1 ]]Eg[E A [X 2 i+2 X 2 i+3 }} = (n-3i) 2 Eg[E A [X 2 i+2 X 2 i+3 }\. 

Here, as earlier, we relied on the fact the graph into which we embed the vertex is random. Let 

m = X 3 i + 2 and m' = -^34+3. Therefore, m denotes the number of ways of embedding the vertex s%i + 2 and 
m! denotes the number of ways of embedding the vertex £31+3. Since the number of edges incident on the 
vertices in G is binomially distributed, Eg[E^[X| i+1 X| i+2 X| i+3 ]] equals 

(n _ s) 2 "!; 1 ™ 2 (j^ ™ ,2 ( m ~ l \ m \i - P ) m - i - m ') ( n ~^ n ~ 1 ) P m (i- P ) n - 3i - 1 - m . 



m=0 



Let Li denote the number of embeddings of a triangle in a random graph from Q(n — 3i,p). Then, the 
denominator 

Eg[E A [X]} 2 =E[L } 2 EfL^] 2 . 

Note that E[Lj] = ( n ^)3\p 3 . Using the above equalities, the critical ratio of averages can be bounded to 

Eg[E A [X 2 }] "fi 1 Eg[E A [X 2 l+l X 2 l+2 X 2 l+3 \] "fi 1 / c 
Eg[E A [X]]i 11 E[Uf ~ 11 V n-3 i 

for a constant c. Again, we obtain a polynomial bound on the critical ratio of averages, which translates to 
an fpras. 

C Decomposition of Subdivision Graphs 

A /c-subdi vision graph of a graph is obtained by inserting k = k(n) new vertices in every edge, that is by 
replacing each original edge by a path of length k + 1. We relax this definition and say that a ^-subdivision 
graph is the graph obtained by inserting at least one and at most k vertices in every edge. Let H be a 
fc-subdivision graph of a graph F. We now show that H has a decomposition of width at most A. 

The main idea behind the decomposition is that as soon as a vertex v of F appears in some Vj, all 
vertices in Njj(v) not in V"- 7 are selected in Vj+i, i.e., v G Uj+i. The decomposition of H can be formally 
defined as, 

{si} where si any vertex in Vf, if i = 1, 
Vi = < N H { ai ) - V*- 1 , if i > 2 and {a,} = V F D D i_1 / 0, 
N H (bi) - V^ 1 where b, L € D*' 1 , otherwise. 

We now argue correctness of the decomposition for which the following lemma is useful. 

Lemma 8. There exists at most one vertex in Vf H D' 1 for all stages i in the decomposition. 

Proof. Proof by induction. True by construction for i = 1. Now consider some j* ft -stage. By the inductive 
hypothesis, VfHD^ 1 has at most one vertex. If there is a vertex, then a,j is this vertex. In this case, Nn(aj) 
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doesn't contain any vertex from Vp (subdivision property). Otherwise, bj ^ Vp, therefore, there is at most 
one vertex of Vf in Vj (subdivision property). Therefore, in both cases, \Vp fl D 3 \ < 1. □ 

It can easily be verified that all properties of the decomposition are satisfied and the width of the decom- 
position is at most A. A direct consequence of this result is that there are some special types of expanders 
which have a decomposition of bounded width. This is because any graph resulting from the constant sub- 
division of a constant-degree expander would still be a constant-degree expander. For example, consider the 
1-subdivision graphQ S(H) of constant degree expander H. S(H) has an ordered bipartite decomposition 
of bounded width. So the only fact that remains to be verified is that vertex expansion ratio of S(H) is a 
constant. 

Lemma 9. A 1-subdivision graph of a constant-degree expander is an expander. 

Proof. Let A be a set of vertices in H. Let a (= constant) denote the vertex expansion ratio of H and A 
denote the maximum degree in H. Let S(H) denote the 1-subdivision graph of H. Let B be a subset of 
vertices from N S ^ H ^(A). We consider the vertex expansion ratios for two different scenarios of B. 

• Case B = 0. In this case, \N S(H) (A)\ >\N H (U)\> a\A\. 

• Case B / 0. First assume that, B = N S ( H )(A). Under this assumption, N S ( H )(A U B) = N H (A). 
Say \N H (A)\ = A. Now even if B C N S ^(A), \N S ( H) (A U B)\ is at least A. Therefore, 

\N S(H) (A U B)\ = A > a\A\ > ^j(\A\ + \B\). 

The final case to consider involves a set of vertices C in S(H), which are not in H. In this case, \N S ( H ^(C)\ > 
\C\/A. 

From the above case analysis it is clear that the vertex expansion ratio of S(H) is a constant, and the 
proof follows. □ 

Remark. Most of the graph classes we considered appear to have small treewidth. So a natural question 
would be to relate these two decomposition schemes. However, treewidth and the width of ordered bipartite 
decomposition are incomparable. In one direction, consider a star graph. Treewidth is 1, but no ordered 
bipartite decomposition of width less than n/2 exists. For the other direction, consider the 1-subdivision 
graph of Lemma[9] S(H) has constant expansion which implies a treewidth G(n) ll20l . 

D Decomposition of Series-Parallel Graphs 

To define the decomposition we need some more terminology. Let H = (Vh, Eh) be a [C3,Cs]-free series- 
parallel graph with (distinguished) terminals s, t. Let Vh = Vl h, V^h , . . . denote a decomposition of H. 
Let V H = Uj<j Vj,H- For a set of vertices S in H define 

D H (i, S) = {u G V^f 1 : there exists v G S such that (u, v) G E H }- 

Dn{i, S) represents the set of neighbors of S in H selected in the first (i — l)-stages. The algorithm starts 
by finishing s and t as follows. 

V ltH = {s}, and V 2 , H = N H (s)-V H . 

= f {t}UN H (D H (3,{t}))-V H ,ift^V H , 
3 ' H \ 0, otherwise. 

V 4 , H = N H (t)UN H (D H (4,N H (t))) - V H . 
'The results also extends to any constant (not necessarily 1) subdivision graph of a constant-degree expander. 
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In words, the above four stages achieve: (i) select s, (ii) finish s, (iii) select t unless already selected, (iv) 
finish t. Define 

Vh = Vl,H, V2,H, V3 t H, V^h,Vh\sJ 

where V^| s t is defined recursively as: 

i. Base case: If all the vertices in H are selected, Vn\s,t = 0- 

ii. Parallel case: If H = H a \ \Hb, find recursively Vn a \ s ,t an d Vg b \ 8i p Define 

^H\s,t = VH a \s,t, ^H b \s,t- 

iii. Serial case: If H = H a © Hi,, with x as the vertex joining H a and Hi,. Let s G Vn a and t G Vn b - 

(a) If x is finished, define V H \ Stt = V Ha \ s , x , V Hb \ x ,t- 

(b) If x G V# (x has already been selected) and x not finished, then finish x. This produces the set 

V 5 , H = N H (x) U N H (D H (5, N H (x))) - V*. Define V H ^ t = V 5 , H , V Hals>x , V Hb{t;X . 

(c) Otherwise, first select x which produces the set V^h = { x } U Nh(Dh(5, {x})) — V^. Then, 
finish x. This produces the set V^ H = N H {x) U N H (D H (6, N H (x))) - Vfj. Define V H \ s ,t = 

V5,H, V^H, ^H a \s,x, ^H b \t,x- 

The properties ® and (D are guaranteed by the construction. The following lemma proves that the V^'s 
form an independent set and provides bounds on the sizes of f/j's. The proof looks at two possible situations, 
conditioning on the presence or absence of paths of length 2 or 3 between s and t. Since both C3 and C5 are 
forbidden, there can either be a path of length 2 or 3 between any two vertices, but not both. This fact will 
be crucial for implying property ®. See Fig. [3] (in Appendix iGb for an example. 

Lemma 10. Let H be a [C3,C^,]-free series-parallel graph with terminals s,t. If there exists no path of 
length 2 or 3 between s and t, and s, t are the only vertices finished, then the above algorithm finds a 
decomposition of H by finishing 0(A 2 ) vertices in every stage. Else if there exists a path of length 2 or 3, 
and s, t and possibly some vertices in Nh(s) U Nh(Nh(s)) are the only vertices finished, then the above 
algorithm finds a decomposition of H by finishing 0(A 2 ) vertices in every stage. 

Proof. The proof is inductive. Let F denote a graph with terminals s', t' appearing in some stage of the 
recursive algorithm. The algorithm always first finishes s' and then t', and once s' and t' are finished the 
parallel components can be handled independently for constructing the decomposition. In the process of 
finishing t' , the algorithm could possibly finish some vertices in Np(s') U Nf(Nf(s')). Hence, in each 
of the parallel components F', terminals s',t' and possibly some vertices in Npi(s') U Nfi(Nf'(s')) are 
finished. Therefore, inductively a decomposition can be obtained. So the challenging case is when F has 
just one parallel component. Let F = F\ © F2 with z as the vertex joining F\ and F2. There are three 
different cases. In each of them the interesting event occurs after s', t' , and z are finished, which splits F 
into Fx and F%. Afterwards, decomposition on F\ and F2 could be constructed independently. 

In the following, we describe the cases under the assumption that there exists no edge between s' and 
t'. If there exists such an edge, then the description would remain the same except that the stage where t' is 
selected would no longer exist (t' is now selected when s' is finished). Also if there is an edge between s' 
and t', then there exists no path of length 2 between s' and t', as, otherwise there would be a triangle. 

Case 1: No path of length 2 or 3 between s' and t'. Note that at the stage when s' is finished no other vertex 
in F is finished. Later, when t' is selected the only vertices in Np(s') that finish at that stage are those which 
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are neighbors of t' (this set is as, otherwise, there would be a path of length 2 between s', t'). Similarly, at 
the stage when t' is finished the only vertices in N F (s') that finish are those which share a common neighbor 
with t' (this set is also as, otherwise, there would be a path of length 3 between s', t'). Now at the stage 
when z is selected some vertices in Np(s') and N F (t') could possibly be finished, and at the stage when z 
is finished some vertices in N F (s') U N F (t') U N F (N F (s')) U N F (N F (t')) could possibly be finished (this 
supplies the 0(A 2 ) bound). However, as soon as z is finished, the graphs Fx and F2 can be handled indepen- 
dently. Now F\ is a smaller series-parallel graph with terminals s' , z, where s', z and possibly some vertices 
in N Fl (s') U N Fl (N Fl (s')) are finished. Therefore, inductively a decomposition of Fx can be completed. 
Similarly, F2 can be viewed as a series-parallel graph with terminals t', z. In F2, terminals t', z and possibly 
some vertices in Np 2 (t')UNp 2 (Np 2 (t')) are finished. Therefore, inductively a decomposition of F2 can also 
be completed. 

Case 2: Paths of length 2 between s' and t'\ So there is no path of length 3 s ~~ ^ s ~~ "* 

between s and t . If r has been selected before s is finished, then t is finished k—i ¥ •.»/ 

together with s (at which stage z is also selected). Note that s and i can be fin- ^ s ^ / 
ished in the same stage because there is no path of length 3. At the stage when z is finished some vertices 
in N F (s') U N F (t') could possibly be finished. Afterwards, we can invoke induction on both Fx and F2. If 
s' is finished before selecting t', then z is finished while selecting t'. At the stage when z is finished some 
vertices in Np(s') could possibly be finished. Later, at the stage when i! is finished some vertices in N F (z) 
could possibly be finished. But again after t' is finished, we can invoke induction on both Fx and F%. 

Case 3: Paths of length 3 between s' and t': So there is no path of length 2 y - ~~~ - ^ 

between s' and t'. There are two sub-cases based on the distance from s' to z. First /'' 'w, "*•./•** "\ /" "N 

*—> — — * K' 

assume that this distance is one. At the stage when s is finished z is selected. At „•**'*?. ../' ..-/ 
the stage when t' is selected no vertex in F is finished (absence of path of length v ~. _ _ - ' 

2). At the stage when t' is finished, z is finished and also some other vertices in Np(s') could possibly be 
finished. Hereafter, induction can be invoked over Fx and F2. 

Now if the distance between s' and z is two. Then, the distance between t' and -- ' ~ "~ ^ 

^ \ 

z is one. At the stage when s is finished no other vertex in F is finished. At the /•'' "\ /" \ /'*" F ''\ 
stage when t' is selected no vertex in F is finished. At the stage when t' is finished, \ * 

z gets selected and some vertices in Np(s') would be finished. Finally, at the stage ^ ^ ^ ' 

when z is finished some vertices in Np(s') U N F (N F (s')) U N F (t') could possibly be finished. Hereafter, 
induction can be invoked over Fx and F2. 

Therefore, a decomposition of H can be obtained with no more than 0(A 2 ) finishing at each stage, i.e., 
every \Ui\ = 0(A 2 ). A more precise upper bound of 2 A 2 can be obtained by a more careful analysis. Also 
as revealed by the proof (unlike triangles) not all pentagons are malevolent for the decomposition. □ 

E Decomposition of Outerplanar Graphs 

Proof of Lemma|3j {Jj+i can only contain vertices that have a path to dj not containing any vertex in U % . 
Since the graph is outerplanar, any path from a, to any vertex w £ I passes through either of the endpoints 
(Sj , s 3l ), both of which are in U l . Hence, Ui + x Q I- Q 

Proof Sketch of Lemma U The first inequality follows as Ui+x ^ V % (definition) and Ui + x C I (Lemma 
[3]). For the second one we use induction over i. The interval I is split into several new intervals (at least two 
as dj G Ui + x) by the vertices of Ui + x, which define the stage i + 1 intervals. The newly created interval are 
of two types: (a) both its end points are from Ui + x, (b) one end point is from Ui and other is from Ui+x- I n 
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the intervals of the first type there are at most 2A vertices from V l+1 (at most A from each of the endpoints) 
and no vertex from V % . In the intervals of the second type, there are at most A new vertices adjacent to the 
endpoint in Ui + \ and at most A old vertices from V % (from the inductive hypothesis and the fact that ai is 
the median). Therefore, each of the newly created stage i + 1 intervals have at most 2 A vertices from V' L+1 . 
□ 

F Decomposition of Planar Graphs 

Proof of Lemma |6j Assume without loss of generality that the graph H is connected, otherwise it is 
sufficient to prove the statement for each of the components. Let H be the graph obtained from H by 
contracting all degree 2 vertices. Then H is a planar graph of minimum degree 3. 

We first show that H contains a face of degree 5 or less. For contradiction, suppose that all the faces have 
degree at least 6. Let n be the number of vertices, m be the number of edges, and k be the number of faces of 
H. Moreover, let F be the set of faces and V the set of vertices of H. Since the degree of each face is at least 
6 (remember that the degree of a face / is the number of edges going around /), 2m = Ylf eF ^ e s(/) — 
Moreover, 2m = Ylv&v deg(v) > 3n, since the minimum degree in H is at least 3. By Euler's formula and 
the previous inequalities: m + 2 = n + k< (2m)/3 + m/3 = m. A contradiction. 

Let / be a face of H that corresponds to a face of the degree 5 or less in H. Since the degree of / is at 
least 16 (the girth is 16), it is easy to see that / contains a 3-thread in H. □ 

Proof of Lemma |7J The properties ® and © of the decomposition are satisfied trivially, whereas the 
property © is satisfied because of the girth restriction on H. 

Let Xi, . . . , X c be a 3-thread partition of H. Let Hi be the graph induced by Vu — Uj<i Xj on H. The 
first observation is that a vertex in any Xj (1 < j < c— 1) has at most one edge connecting it to the vertices 
in Xj + i U • • • L)X C . Consider some stage % of the decomposition. Let I be the largest index with an unselected 
vertex z\. From the previous observation it follows that vertices in N(zi) that are in X\ U • • • U X\_\ are not 
selected, in stages 1 to i — 1. Assume otherwise. Let u be a vertex belonging to N(z{) D Xir(l' < I) that is 
selected in the first i — 1 stages. Then u needs to have a neighbor in X\ U • • • U X c — {zi}, a contradiction 
since it would imply that u (which is in X//) has two neighbors in X\ U • • • U X c . Therefore, till stage i 
none of the neighbors of z\ in X\ U • • • U X\-\ have been selected. By definition of threads zi could have 
at most two neighbors in Hi. The cases where it has two neighbors are one of the following: (a) z\ has one 
neighbor from Xi + \ U • • • U X c and another from Xi, or (b) z\ has both its neighbors from X\. This implies 
that \N H (zi) n D*- 1 ] = \Ui\ < 2, and |K| < 2A. □ 

G Proof of Theorem |H 

Proof of Theorem |5j Assume Hn does not have a connected component that contains a vertex of every 
row. Let Hjj = (Vjj, Ey) be the subgraph of Hp generated by all the vertices connected to the top row, i.e., 
Hjj is a collection of those connected components in Hp that have at least one vertex from the top row. By 
assumption, Hjj does not contain any vertices from the bottom row. 

For every 2x2 sub-grid of H with vertices a, b, c, d and edge (a, b) G E' , we call (a, b) a boundary 
edge if exactly one of c, d is in Vjj and neither of a or b are in Vjj. Let Hb = (V — Vjy, Eb) be the subgraph 
of Hb, where Eb is the set of boundary edges. We assign the color red to all the vertices in Vjj and color 
black to all the vertices in V — Vjj. Over the following two claims we make some observations about the 
structure of Hb- For a vertex v, let c{v) indicate whether the vertex is colored red (r) or black (b). 

Claim 1. There are no degree 3 vertices in Hb, i.e., all vertices in Hb have degree 0, 1, 2, or 4. 
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Proof. Assume to the contrary. Let u be a degree 3 black vertex. Let (u, v\), (14,1*2), (u, 1*3) be the 
only edges incident on u. Trivially c(it) = b,c(vi) = b, c(i?2) = b,c(vs) = b, 0(^4) = 6. For the 
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other vertices, there are only two possibilities: (i) c(w\) = b,c(w2) = r, 0(103) = b, 0(104) = r and (ii) 
c(w\) = r, c(u)2) = b, 0(103) = r, 0(104) = b. As all the edges in Hp are all either between two red vertices 
or two black vertices and every 2x2 sub-grid has exactly one edge, V4 is black and there exists an edge 
between (it, 1)4) in Hb- Therefore, every vertex in Hb has degree either 0, 1, 2, or 4. □ 

As in the previous claim, by considering all possibilities for the neighbors of u being in Vjj or not, 
one can conclude immediately that all vertices of degree 1 are on the left or right border and there are odd 
numbers of degree 1 vertices on each border. 

Claim 2. All the degree 1 vertices oJHb are either on the left or the right border of the grid H. Additionally, 
there is an odd number of degree 1 vertices of Hb on the left and on the right border. 

Every connected component in Hb has an even number of degree 1 vertices. From Claim |2j we know 
that degree 1 vertices only occur at the left and right boundary of Hb and there are odd number of them 
on both boundaries. Putting these two statements together implies that there exists a component in Hb 
(therefore, in Hb) that connects the left and the right border. See also Fig. [3] □ 
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Figure 3: The first graph is outerplanar and the second is series-parallel. Vertices with label i constitute Vi. Neighbors 
of Vi with lower labels constitute Ui. The third figure illusttates the negative result. The dotted diagonal lines are the 
edges in Hjj, and the solid diagonal lines are the edges in Hb- There exists a component in Hb than spans from the 
left to right boundary. 
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